Detecting q- Gaussian distributions and the normalization effect 
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We show that whenever data are gathered using a device that performs a normalization-preprocessing, the en- 
suing normalized input, as recorded by the measurement device, will always be q-Gaussian distributed if the 
incoming data exhibit elliptical symmetry. As a consequence, great care should be exercised when "detecting" 
q-Gaussians. As an example, Gaussian data will appear, after normalization, in the guise of q-Gaussian records. 
Moreover, we show that the value of the resulting parameter q can be deduced from the normalization technique 
that characterizes the device. 
PACS: 05.40.-a, 05.20.Gg, 02.50.-r 



INTRODUCTION 

Systems statistically described by power-law probability dis- 
tributions (PLD) are rather ubiquitous HI] and thus of peren- 
nial interest J2, 0, 01 . Indeed, many objects that come in 
different sizes have a self-similar power-law distribution of 
their relative abundance over large size-ranges, from cities 
to words to meteorites (!]]. Now, PLDs under variance con- 
straint maximize a non-logarithmic information measure, of- 
ten called Tsallis' entropy or q-entropy H q 
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This measure tends to the celebrated Shannon entropy in the 
limit q — > 1 J2, 0. Systems for whose description H q is 
relevant have received intense attention in the last years, with 
more than 1200 papers extant and hundreds of authors |0]. To 
a large extent, the relevance of the concomitant treatments re- 
lies on the fact that one often confronts a particular scenario: 
measuring real data distributed according to a q-Gaussian 
probability law, a special kind of power-law probability dis- 
tribution function (PDF). Consider a system S described by a 
vector X with d components whose covariance matrix reads 



K = {XX 1 ) = EXX\ 



(2) 



the superscript t indicating transposition. We say that X is 
q— Gaussian distributed if its probability distribution function 
writes in one of the two forms to be found below |2l |3j |4J] : 
for i < q < |±4 
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Matrix A is related to the covariance matrix K according to 
A = (m — 2) K, where the number of degrees of freedom rn 
is defined O] in terms of the dimension d of X as m = -=r — 

1 - J q—l 

d. Instead, in the case q < 1 the q— Gaussian distribution is 



fx, q (X) = 
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where the matrix S is related to the covariance matrix via 
S = pK. We introduce here a parameter p defined as 
p = 2j5| + d an d use the notation = max(x, 0). The 
focus of the present endeavor revolves around the influence of 
the measurement device on the distribution of these data. The 
many authors cited above together with their readers should 
find the considerations developed below, concerning the influ- 
ence of the so-called normalization stage on the performance 
of a measurement device, of great interest B12I1 . 
Most measurement devices consist of a preprocessing stage 
that prevents the rest of the device to be provided with data 
of exceedingly large amplitude that would cause damage to 
the hardware. Since most of measured data are of stochastic 
nature, the concomitant most natural and common technique 
is to statistically normalize these input data. Since quite of- 
ten the relevant statistical properties are of unknown charac- 
ter, the normalization process consists of two steps: the data 
are first centered by substraction of their estimated mean, and 
then scaled by division by their estimated standard deviation. 
In what follows, we detail these operations, their statistical 
consequences, and the rather surprising impact the procedure 
may have with regards to non-extensive q-considerations. 



normalization measurement 
Figure 1 : a measurement device 

THE CASE OF MULTIVARIATE GAUSSIAN DATA 
Both mean and variance unknown 

Assume that we have n observations {Xi} 1<i<n of identically 
distributed data, each Xi being a vector in M p , and that neither 
the mean p nor the covariance matrix E are known. A first 
step of the normalization process consists in centering the data 
by substraction, from each Xu of an estimate p, of its mean 
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devised as follows 



and 



n 
3=1 



(5) 



We note that this estimate coincides with the maximum like- 
lihood one if the data are of Gaussian nature. Let us denote, 
with 1 < i < n, the residuals 



£i — Xi 



I' 
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so that 



Ee, 



The next step is the scaling of these residuals by the unbiased 
estimate of the (p x p) covariance matrix 



n 
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Then the normalized version of vector Xj, denoted as 
writes 



Y = £~ e, 



3=1 



(8) 



We assume here that n > p + 1 so that the estimated co- 
variance matrix £ is positive definite with probability one @|. 
This procedure is called internal Studentization, "internal" re- 
ferring to the fact that the estimated covariance matrix E is 
built upon all available data Xj, 1 < j < n, including the 
vector Xi to which it is applied. Another procedure is the so- 
called "external Studentization": it consists in normalizing 
the data Xi using an estimate of the covariance matrix that 
involves all data except Xi, according to 



1 n 



(9) 



so that we denote by 



(10) 



the resulting normalized data. It turns out that the distribu- 
tion of normalized data Y^ and Zi can be explicitly computed 
when the measured data Xi are Gaussian, as follows from the 
theorem below due to Diaz-Garcia et al. Jit] . 

Theorem 1. [Diaz-Garcia] suppose that matrix X = 
\Xi , . . . , X n ] is Gaussian-A/" (p <8> l n , In <8> £) —distributed 
i l75l/ w/f/z 1„ = [1, . . . , 1] S R". Then the normalized data Yi 
and Zi are q-Gaussian distributed 
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We remark that in the case of internal Studentization, the nor- 
malized data have bounded support . This corresponds to a 
"hard normalization" strategy that ensures the boundedness of 
the data that feed the measurement device. On the other hand, 
external Studentization corresponds to a "soft normalization" 
strategy in which boundedness of the data is not crucial to the 
proceedings. 

Moreover, a third strategy, that may be called "full external 
Studentization", is considered in |7j]: in order to normalize 
vector Xi, the estimate of the covariance matrix S and of the 
mean /i are built with all available data but Xi. This can be 
the case whenever the measurement device builds estimates 
on a batch basis. As a consequence, the estimated mean is 



1 



and the estimated covariance matrix is 



1 " 
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In such a scenario the following result holds 01. 

Theorem 2. [Eaton ] Suppose that matrix X = [X\ , . . . , X n ] 
is Gaussian Af (fJ, ® l n , I n ® S) —distributed. Then, the ran- 
dom vector 



V t = (X t - 
has probability distribution function (pdf) 
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Proof. See the proof in [Q], where it is qualified as a "routine 
multivariate calculation". □ 



Unknown variance 

In some contexts one may assume that the mean of the data 
is known. Thus, by replacing Xi by Xi — fi one may state, 
without loss of generality, that the mean equals zero. The 
two strategies - internal or external Studentization - remain 
possible, except that fi should now be replaced by in ([8]), 
( TTOl ). and ©. The distributions of the normalized data are 
expressed as follows. 
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Theorem 3. Under the same hypotheses as in Th\l\ and as- 
suming that the expectation of the data vanishes, the normal- 
ized data Yi and Z A are g-Gaussian distributed 



with 



fy< on 
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We remark that the known mean distributions (TT~5T > and (fTBT l 
can be recovered from the unknown mean ones ( fTTT > and dT~2T > 
by replacing the parameter n by n — 1. 



EXTENSION TO ELLIPTICAL MATRIX DISTRIBUTIONS 

The class of elliptically distributed random matrices plays an 
important role in statistics . Let us devote a few words to 
the concept of elliptical symmetry, a generalization of the cel- 
ebrated spherical symmetry to which the multi-normal distri- 
bution belongs. Spherical symmetry, that is invariance against 
rotations, found in the fundamental laws of nature, constitutes 
one of the most powerful principles in elucidating the struc- 
ture of individual atoms, complicated molecules, entire crys- 
tals, and many other systems. Elliptical distributions have re- 
cently gained a lot of attention in financial mathematics, being 
of use particularly in risk management. In what follows, we 
restrict our attention to the subset of elliptical matrices which 
have absolutely continuous distributions. The pertinent defi- 
nition reads as follows ifioll . 

Definition. A (p x n) random matrix X has a ma- 
trix variate, elliptical contoured distribution, denoted as 
i h) , if its probability distribution function 



(M, S 



writes 



fx 



(X) 
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tr ((X - Mf Yr l {X - M)) 



(17) 



where the dimensions of the involved matrices are, respec- 
tively, T : (p x n) ,M : (p x n) , S : (p x p) , <£> : (n x n) . 
Moreover, matrices £ and <£> are definite positive and function 
h : [0, oof — > R + is called the density generator of X. In what 
follows, we restrict our attention to the case $ = /„. 

A very important result of this paper can be phrased as fol- 
lows: 

Theorem 4. Theorem\l\\^and\3\still hold under the general 
assumption that X ~ £ p „ (M, £ (g) /„, h) . 

Proof. By proper scaling, it suffices to consider X ~ 
£p,n (0,lp,n,h) : in this case, a stochastic representation of 
X is, by lemma ([T|i below 



U 



N 



(19) 



N being a Gaussian J\f n , P (0, I n ,p) matrix. Since any of the 
vectors Yi, Zi and Vi given by equations ([8]), ( fTOb and ( fT3l are 
homogeneous functions of order of the data Xi, we deduce 
that Gaussian data can be replaced by uniform data according 
to ( fT9l ). and thus by elliptical data according to (1181 . □ 

Note that the proof can also deduced from the more general 
Thm. 5.3.1 in d. 

Moreover, this result can be generalized to a wider class of 
distributions by noticing that the distribution of the normal- 
ized residuals does not depend on the covariance matrix £ of 
the data. Consequently, matrix E can be randomly chosen 
within the set of positive definite matrices, independently of 
the data. We thus can state the following 

Theorem 5. The result of theorem \3\ extends to the general 
case X ~ £ Pin (M, S £g> /„ , h) with a random and positive 
definite matrix S. 



DISCUSSION AND CONCLUSIONS 

The following sketch illustrates the main result of this paper. 

, external 
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Studentization 

internal 

Figure 2: the effect of Studentization 



In other words, any of the three scaling preprocessing oper- 
ations described above maps the set of elliptically invariant 
distributions on the set of q— Gaussian distributed data. De- 
pending on the scaling method used, the resulting value of the 
parameter q is given as follows. 



Studentization method 


internal 


external 


full external 


value of q 


n — p — 3 


n + l 1 
n — 1 


n + l : 

n — 1 



Table I: Values of parameter q according to the normalization proce- 
dure: case where both mean and covariance are unknown 



Studentization method 


internal 


external 


value of q 


n—p—A ^ -i 
n-p-4 v 1 


n + 2 : 

n 



Table II: Values of parameter q according to the normalization pro- 
cedure: case where only the covariance is unknown 



X = rU 



(18) 



Two remarks are of interest at this point: 
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1. observation of the data after the normalization stage 
does not allow to infer the distribution of the data: 
in particular, putative Gaussian data are systematically 
transformed into q— Gaussian data. 

2. in the same way, q— Gaussian data are transformed into 
q'— Gaussian data, with parameter q' given by one of 
the values in Table 1, depending on the normalization 
procedure: this means that the real value of parameter q 
is "erased" by the normalization process. 

As a conclusion, the origin of q— Gaussian data should be 
carefully analyzed, since they may occur, for a very large set 
of recorded data (namely the set of elliptical ones), as a sim- 
ple consequence of a statistical normalization step. In other 
words, the putative "q— Gaussianity" may be a mere artifact 
of the statistical normalization step: caution is to be exercised. 

As an example, a measured value of the nonextensivity pa- 
rameter q appears in 014l p.230] in the context of financial 
markets as follows: 

"... returns (once demeaned and normalized 
by their standard deviation) have a distribution 
that is very well fit by q-Gaussians with q w 1.4." 

The estimated value q = | corresponds to an external Stu- 
dentization with n = 6 data. Now, the general result from our 
theorem [5] above suggests that the "real" distribution of these 
data may well indeed differ from the estimated distribution. 

ANNEX: STOCHASTIC REPRESENTATION 

Lemma 1. If X ~ £ Ptn (M, S £g> I n , h) then the stochastic 
representation 

A = M + r^U 

holds where U is a uniform matrix on the manifold of (p x n) 
matrices with unit Frobenius norm (see Footnote 2 ) hl(M . and 
r is a positive random variable independent of matrix U. 

Proof. Associate to A (n x p) the vector a = vec (A) € M. np 
so that ||a|| 2 = ||A|| 2 and 

/ a (a)^/(||a|| 2 ). 

Thus a stochastic representation for vector a is II 1311 a = ru 
where u is uniformly distributed on the sphere in R np and r is 
positive and independent of u: thus u writes 



where g is a (n x p) Gaussian vector; we deduce that 

A = rU 



where U is obtained by stacking the columns of u in a (n x p) 
matrix form so that 



y/tr (G*G) 

where G is a Gaussian (n x p) matrix, so that U is uniform 
on the set of (n x p) matrices with unit norm. □ 
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